Two Principles and Six Techniques for Rapid Mt Development

نویسندگان

  • Sergei Nirenburg
  • Stephen Beale
  • Stephen Helmreich
  • Kavi Mahesh
  • Evelyne Viegas
  • Remi Zajac
چکیده

In this paper we describe a range of techniques used at NMSU CRL for accelerating the development of MT systems. These techniques enable semi-automatic development of a number of components of a multilingual MT system, thereby enabling rapid deployment of MT capabilities in a new language. First, we describe the core multi-engine, multilingual architecture that enables the different techniques to be rapidly integrated to build an MT system. We show how off-the-shelf components were used in this architecture for fast development. Then we illustrate a set of techniques for semi-automatic acquisition of static resources: (a) automatic induction of grammars, (b) corpus-based acquisition of bilingual glossaries, and automatic acquisition of semantic lexicons through (c) lexical rules and (d) reversal of analysis lexicons to generation lexicons. Finally we describe an automatic testing environment that enables rapid validation of automatically acquired resources. 1 Rapid Development Techniques Static knowledge sources — grammars, lexicons, world knowledge bases — are the most time-consuming concerns in any rule-based machine translation system. It is, therefore, imperative to find ways of speeding up the creation and updating of high-quality, useful static knowledge sources. It is equally imperative to rely on a robust and flexible core computational architecture that allows the concurrent manipulation of a large number of static and dynamic knowledge sources as well as documents and document collections. In this paper, we describe several techniques for facilitating rapid development of MT capabilities for a new language in the framework of an existing multilingual system. Our approach is based on the following two principles: • Heterogeneous, Multi-Engine, Multilingual Architecture: a multi-engine architecture where different subsets of MT techniques can be combined for different languages, accelerates development; it takes longer to perfect any one prespecified MT method for a new language to deliver comparable initial capabilities.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chlorophyll meter – a decision-making tool for nitrogen application in wheat under light soils

Nitrogen (N) in plants is generally diagnosed by a soil test and plant tissue analysis.However, such analyses are costly in terms of time and money and are not easily accessible byresearchers and extension workers, let alone farmers. Alternative cost-effective methods arerequired for rapid analysis of the N status of crops and to guide N management in wheat. Theobjective of this study was to as...

متن کامل

Glow Discharge Depth Profiling a Powerful Analytical Technique in Surface Engineering (TECHNICAL NOTE)

A variety of analytical techniques have been developed and employed to characterize the surfaces, subsurfaces and interfaces of surface engineering systems. They provide important information for quality control, process optimization and further development. Since the mid 1980's, glow discharge spectrometry (GDS) has emerged as an important and versatile technique for rapid depth profiling anal...

متن کامل

The Effectiveness of Mirror Therapy on Upper Limb Function in Stroke Patients: A Single Case Experimental Design

Objectives: To assess the effectiveness of mirror therapy (MT) on upper limb (UL) function of sub-acute stroke patients. Methods: This study is a single case experimental design with two participants. Twenty minutes of MT were implemented four times a week over a period of four weeks. For baseline phase, repeated measurements were performed six times for one participant and four times for the ...

متن کامل

Developing Goodson’s model for rapid performance assessment of emergency department

Over the past years, raising costs of health care in most countries cause to attract more attention to different aspects in the field. One of the best improvement methodologies known in literature is based on lean principles. The main aim of this methodology is to create values in the system by eliminating losses and creating continuous efforts toward improvement. Therefore, by measuring the pe...

متن کامل

Polyomavirus middle T-induced mammary intraepithelial neoplasia outgrowths: single origin, divergent evolution, and multiple outcomes.

The development of models to investigate the pathobiology of premalignant breast lesions is a critical prerequisite for development of breast cancer prevention and early intervention strategies. Using tissue transplantation techniques, we modified the widely used polyomavirus middle T (PyV-mT) transgenic mouse model of breast cancer to study the premalignant stages of tumorigenesis. Premalignan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996